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1. Introduction 
Anger can shape economic outcomes. Consider three cases: 


Case 1: In 2015 Turing Pharmaceuticals raised the price of Daraprim, a therapeutic drug, from $12 to $750 per dose. The 
company was subsequently accused of price gouging. Should Turing have considered the consequences of customer 
anger before determining the new price for the drug? 

Case 2: When local football teams favored to win instead lose, the police get more reports of husbands assaulting wives 
(Card and Dahl, 2011). Do unexpected losses spur vented frustration? 

Case 3: Following the sovereign debt crises that began in 2009, some EU countries embarked on austerity programs. Was it 
because citizens lost benefits that some cities experienced riots? 


“ This paper modifies and extends Smith (2009). Pierpaolo Battigalli gratefully acknowledges financial support from ERC advanced grant 324219. We 
thank Chiara Aina, Federico Bobbio, Roberto Corrao, Carlo Cusumano, Giacomo Lanzani, Paolo Leonetti, Paola Moscariello, and Marco Stenborg Petterson 
for excellent research assistance, and Doug Bernheim, Steve Brams, Leda Cosmides, Vince Crawford, Nicodemo De Vito, Uri Gneezy, Pierfrancesco Guarino, 
Michele Griessmair, Heike Hennig-Schmidt, Botond Készegi, Alex Imas, Joshua Miller, David Rietzke, Julio Rotemberg, Emanuel Vespa, several reviewers, and 
many seminar and conference audiences for helpful comments. 

* Corresponding author. 

E-mail addresses: pierpaolo.battigalli@unibocconi.it (P. Battigalli), martind@eller.arizona.edu (M. Dufwenberg), alecsmith@vt.edu (A. Smith). 


https: //doi.org/10.1016/j.geb.2019.06.001 
0899-8256/© 2019 Elsevier Inc. All rights reserved. 


16 P. Battigalli et al. / Games and Economic Behavior 117 (2019) 15-39 


Pricing, domestic violence, political landscapes: these are important themes. We propose that others (involving—say— 
recessions, contracting, arbitration, terrorism, road rage, or support for populists) could plausibly be imagined. However, to 
assess the impact of anger on social and economic interactions, one needs a theory that predicts outcomes based on the 
decision-making of anger-prone individuals and that accounts for strategic considerations. We develop such a theory and 
focus on leader-follower (LF-) games with two players, one of whom moves first, while the other observes and reacts. 

Insights from psychology about the triggers and repercussions of anger are evocative. The behavioral consequences of 
emotions are called “action tendencies,” and the action tendency of anger is aggression and urge to retaliate. Angry players 
may be willing to forgo material gains to punish others, or be predisposed to aggression when this serves as a credible 
threat. But while insights of this nature can be gleaned from psychologists’ writings, their analysis usually stops with 
the individual rather than going on to assess overall economic implications. We take the basic insights about anger that 
psychology has produced as input and inspiration. ! 

We study the strategic interaction of decision makers who become angry when they are frustrated.” Frustration occurs 
when someone is unexpectedly denied something he or she cares about. We assume that people are frustrated when they 
get less material rewards than expected.’ They then become hostile towards whomever they blame. Because a player's 
frustration depends on his beliefs about others’ choices, and the blame a player attributes to another may depend on his 
beliefs about others’ choices or beliefs, all our models find their intellectual home in the framework of psychological game 
theory; see Geanakoplos et al. (1989) and Battigalli and Dufwenberg (2009) (B&D). 

Initially expected material payoffs are the reference point to which outcomes are compared to generate frustration. 
This modeling choice mirrors several other behavioral models, including Készegi and Rabin’s (2006, 2007) model of 
reference-dependent preferences, guilt aversion (Dufwenberg, 2002, Battigalli and Dufwenberg, 2007), and earlier models 
of disappointment aversion (Bell, 1985, Loomes and Sugden, 1986). While this approach may not incorporate every aspect 
of frustration discussed in the psychology literature, it is consistent with a range of phenomena, allowing for frustration and 
anger to be belief-dependent and capturing the stylized fact that costly punishment can violate consequentialism (e.g., Falk 
et al., 2003, 2008). 

There are many ways to model blame.* We present three approaches that result in distinct utilities. Players motivated by 
simple anger (SA) become generally hostile when frustrated. In contrast, those motivated by anger from blaming behavior 
(ABB) or anger from blaming intentions (ABI) are more discriminating, asking who caused, or intended to cause, their 
frustration. SA captures the psychological phenomenon of displaced aggression, where an angry person takes out frustration 
on a blameless bystander.’ However, for some authors, blame or other-responsibility is a prerequisite for anger.° ABB and 
ABI pick that up. 

Players have beliefs about both others’ beliefs and actions as well as their own actions. Because frustration results 
from diminished expectations, first-movers are never frustrated at the root and maximize their expected material payoffs. 
We define and establish the existence of a notion of sequential equilibrium (SE) that adapts to our framework the one 
developed in B&D. In pure-strategy SE, frustration arises only off the equilibrium path, and furthermore, in generic perfect 
information game forms, there is always an SE with anger that is realization-equivalent to the material-payoff equilibrium, 
though anger may also result in additional equilibria. In LF-games, a follower with SA, ABB, or ABI always does at least as 
well, materially, as a player who is not anger-prone.’ We also formally develop the notion of a threat in order to provide a 
partial characterization of SE with anger: the presence of threats allows anger-prone followers to obtain more than in the 
material-payoff equilibrium and give less to the leader, while their absence implies that equilibria with SA, ABB, or ABI are 
equivalent to the material payoff equilibrium. 

Our models can encapsulate Cases 1 and 2 above (for Case 3, cf. Passarelli and Tabellini, 2017). Case 1 is captured by an 
ultimatum minigame (an example of an LF-game), where SA and ABB allow for a pure SE involving rejection of the greedy 
offer, and all our concepts allow for an SE where rejection occurs with positive probability. Case 2 is illustrated with SA in 


1 The psychology literature is huge. A source of inspiration is International Handbook of Anger (Potegal et al., 2010) offering a cross-disciplinary perspective 
reflecting “affective neuroscience, business administration, epidemiology, health science, linguistics, political science, psychology, psychophysiology, and 
sociology” (p. 3). The absence of “economics” in the list may indicate that our approach is original! 

2 A large body of work in psychology connects frustration, anger, and aggression, beginning with Dollard et al. (1939). See, for example, Averill (1982), 
Berkowitz (1989), and the (op. cit.) Handbook, especially the chapters by Lewis, Wranik and Scherer, and by Berkowitz. 

3 That frustration depends on expectations is well supported in the psychology literature, e.g., Berkowitz (1978, p. 697): “Unlike deprivations, frustrations 
can only be surprising (to a greater or lesser extent). People who do not expect to reach their goals are not anticipating the pleasure these goals would 
bring. Their hopes are not dashed if they have no hopes.” Our focus on material rewards is admittedly restrictive. See the Discussion in Section 5. 

4 See, e.g. Alicke (2000), Battigalli and Dufwenberg (2007), and Halpern (2016, Chapter 6). 

5 See Marcus-Newhall et al. (2000). 

§ See, e.g., the chapter by Wranik and Scherer in the (op. cit.) Handbook. 

7 Taking into account that evolutionary pressure is driven by material payoffs (e.g., Buss, 2016), this result is consistent with the work of Sell et al. (2009), 
who argue that anger is the result of a process of natural selection for behaviors that resolve bargaining conflicts in favor of the anger-prone individual. 
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an example that we call “hammering-one’s-thumb.”® In contrast, incorporating notions of blame into our analysis, either of 
ABB and ABI eliminates such displaced aggression. 

A small literature examines the role of anger in economic behavior. Selten (1978) discusses the implications of the 
frustration-aggression hypothesis of Dollard et al. (1939) for behavior in the chain store game, though he does not de- 
velop a model.? Other early work exploring the role of anger in solving commitment problems includes Hirshleifer (1987), 
Frank (1988), and Elster (1998). Most recent studies are empirical or experimental, indicative of hostile action occurring 
in economic situations, based on either observational data or experimental data.!° A few studies present theories different 
from ours, including Rotemberg (2005, 2008, 2011), Brams (2011), Winter (2014), Winter et al. (2016), Akerlof (2016), and 
Passarelli and Tabellini (2017). 

We present our three models in Section 2. Section 3 contains most of our analysis, defining SE and exploring results 
focused mainly on the class of LF-games. Section 4 compares our approach to others (those mentioned above, plus some 
models of distributional preferences and reciprocity). Section 5 contains a broad concluding discussion. Proofs are collected 
in the Appendix. 


2. Three models 
2.1, Preliminaries 


Game form We restrict attention to finite two-stage game forms with observed actions, meaning that first-period choice 
profiles become public information at the beginning of the second period. The set of players is I. To ease notation, we 
assume that all players take actions simultaneously at each stage. Thus, nodes are histories h of action profiles a‘ = (aj) ie? 
observed by all; h = @ is the empty history (the root), h = (a') a history of length one, which may be terminal or not, 
and h = (a',a*) a history of length 2, which is terminal. H is the set of nonterminal histories and Z is the set of terminal 
histories (end nodes). The set of feasible actions of i after h € H is Aj;(h). This set is a singleton if i is not active given h. 
Thus, for all he H, I(h) = {ie 1: |Ai(h)| > 1} is the set of active players following h. Given the observed actions property, 
if I(h) is a singleton for each h € H then the game has perfect information. We omit parentheses whenever no confusion 
may arise. For example, we may write h =a! instead of h = (a'), and h= (a},a*) if i (resp. j) is the only first (resp. 
second) mover. Finally, we let A(h) = xje;Aj(h) and A_j;(h) = x j4;Aj(h). The material consequences of players’ actions are 
determined by a profile of monetary payoff functions (xi :Z> R)j. , A perfect information game has no relevant ties if 


distinct terminal histories yield different payoffs for the player who is active at the longest common prefix.''! For two-stage 
games, this means that different actions of the first mover lead to different material payoffs for the first mover, and different 
actions of a second mover lead to different material payoffs for this second mover. If the game contains chance moves, we 
augment the player set with a dummy player c (with c ¢ I), who selects a feasible action at random. Thus, let I; = 1 U {c}, 
and the sets of first and second movers may include c: 1(@), I(a') € I. If the chance player is active at h € H, its move is 
described by probability mass function o¢(-|h) € A(A¢(h)). 


Personal histories To model how i determines the subjective value of feasible actions, we add to the commonly observed 
histories h € H also personal histories of the form (h, aj), with aj € Aj(h). In a game with perfect information, (h,a;) € HUZ. 
But if there are simultaneous moves at h, then (h,aj) is not a history in the standard sense. As soon as i irreversibly 
chooses action aj, he observes (h, a;), and can determine the value of a; using his beliefs conditional on this event (i knows 
in advance how he is going to update his beliefs conditional on what he observes). We denote by H; the set of histories 
of i—standard and personal—and by Z(hj;) the set of terminal successors of h;.!* The standard precedence relation < for 
histories in H U Z is extended to Hj; in the obvious way: for all h € H, i € I(h), aj € Aj(h), and a_j; € A_;(h) it holds 
that h < (h,a;) and, furthermore, (h, aj) < (h, (aj,a_;)) if i is not the only active player at h. Note that h; < h; implies 
Zhi) © Z(h;), with strict inclusion if hj ¢ H and at least one player (possibly c) is active at hj. 


8 The example is inspired by Frijda (1993), who says “Many experiences or responses of anger... are elicited by events that involve no blameworthy 
action” and suggests that simple frustrations such as “one’s car refusing to start, finding one’s bicycle has a flat tyre, rain on the fifth day of one’s holiday 
after four previous days of rain... hitting one’s head on the kitchen shelf, dropping a needle for the third time in a row” or “hammering one’s thumb” may 
result in anger. He goes on to say that “the target of anger may be a person who has fallen ill on the day of one’s party, or one who just happened to be 
present when a plan failed”. 

9 The chain store stage game is strategically equivalent to the ultimatum minigame. 

10 See Anderson and Simester (2010) and Rotemberg (2005, 2011) on pricing; Card and Dahl (2011) and Munyo and Rossi (2013) on violence; Carpenter 

and Matthews (2012), Gurdal et al. (2014), Gneezy and Imas (2014), Persson (2018), van Leeuwen et al. (2018), Aina et al. (2018), and Dufwenberg et al. 

(2018a, 2018b) for experiments. 

11 See Battigalli (1997). It can be checked that the property is generic with respect to material payoff functions 7 ¢ R*!: the closure of the set of z that 

do not satisfy it has Lebesgue measure 0 in R2*!. 

12 That is, Hj = H U {(h,q;):heH,ieI(h),a; € Aj(h)}. The definition of Z(h;) is standard for hj ¢ H; for h; = (h,a;) we have Z(h,aj) = 
U Z(h, (aj, a_j)). 


a_j€A_j(h) 
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Beliefs It is conceptually useful to distinguish three aspects of a player’s beliefs: beliefs about co-players’ actions, beliefs 
about co-players’ beliefs, and the player’s plan, which are beliefs about own actions. Beliefs are defined conditional on each 
h; € H;. Abstractly denote by A_; the space of co-players’ beliefs (the formal definition is given below). Player i’s beliefs can 
be described as conditional probability measures over paths and beliefs of others, i.e., over Z x A_j;. Events, from i’s point 
of view, are subsets: Events about behavior take form Y x A_;, with Y C Z; events about beliefs take form Z x E,_,, with 
E,_,; G A_j. Here we provide an abridged and somewhat informal description of beliefs that is sufficient to understand the 
main text. The Appendix contains the formal analysis used in proofs. 


First-order beliefs For each hj € Hj, player i holds beliefs a;(-|hj) € A(Z(hj)) about the actions that will be taken in the 
continuation of the game. The system of beliefs aj = (@i(-|hi))h,<H, Must satisfy two properties. First, the rules of condi- 
tional probabilities hold whenever possible, that is, a; satisfies the chain rule of conditional probabilities (Eq. (7), see the 
Appendix). Second, if at history h € H player i moves simultaneously with other players, what i believes about the simulta- 
neous actions of co-player is independent of his action, that is, such marginal probabilities are the same conditional on h and 
on (h,aj;), for every a; € A; (h). We call this natural property own-action independence (see Eq. (8) in the Appendix). 

To ease notation, for each h € H and each action profile a = (aj,a_;) € A(h), let aj(alh), aji(ai|h), and aj,—j(a_i|h) 
respectively denote the (marginal) conditional probabilities assigned by a; to a, aj, and a_;. The chain rule and own-action 
independence imply 


at; (aj, a_j|h) = of, (aj|h)aj,_; (@_i|h). 


Thus, a; is made of two parts, what i believes about own behavior and about the behavior of others. The array of probability 
measures Qj € XheHA(Aj(h)) is—technically speaking—a behavior strategy, and we interpret it as i’s plan. The reason is 
that the result of i’s contingent planning is precisely a system of conditional beliefs about what action he would take at 
each history. If there is only one co-player, also aj,-; € Xpey A (A_j(h)) corresponds to a behavior strategy. With multiple 
co-players, a;,_; corresponds instead to a “correlated behavior strategy.” Whatever the case, a;,_; gives i’s conditional beliefs 
about others’ behavior, and these beliefs may not coincide with the plans of others. We emphasize: a player’s plan does 
not describe actual choices, actions on the path of play are the only actual choices. A conditional belief system a; satisfying 
the chain rule and own-action independence is a first-order belief of i. Let A} denote the space of such beliefs. It can be 
checked that At is a compact metric space, hence the same holds for AD, =x ivi Aj, the space of co-players’ first-order 
beliefs profiles. 


Second-order beliefs Players also hold beliefs about the beliefs of co-players. In the following analysis, the only co-players’ 
beliefs affecting the values of actions are their first-order beliefs. Therefore, we limit our attention to second-order be- 
liefs, ie., systems of conditional probability measures 6; = (Bi (-|hi))p,cu, € XhjeH; A (Z(hi) x Al.) that satisfy the chain rule 
(Eq. (9), in the Appendix) and the appropriate version of the own-action independence property for second-order beliefs: 
given any history h € H, what i believes about the simultaneous action of co-players and their first-order belief systems con- 
ditional on h and conditional on (h,a;) is the same for every a; € Aj; (h) (see Eq. (10) in the Appendix). The space of such 
second-order belief systems of i is denoted by A?. It can be checked that the marginalization onto Z of any 6; € A? yields 
a system of first-order beliefs satisfying the chain rule and own action-independence, that is, an element a; of Ab This 
a; is the first-order belief implicit in 6;. Whenever we write in a formula beliefs of different orders for i, we assume that 
a; is derived from f;, otherwise beliefs of different orders would not be mutually consistent. Also, we may omit the empty 
history, as in 6; (E) = B; (E|@) or aj(a) = aj(a|@). 


Conditional expectations Let yw; be any real-valued measurable function of variables that i does not know, e.g., the terminal 
history or the co-players’ first-order beliefs. Then i can compute the expected value of w; conditional on any hj; € H; by 
means of his belief system 6;, denoted E[y;|hi; i]. If wi depends only on actions, ie, on the path z, then E[y;|hi; B;] 
is determined by the a; derived from f;, and we can write E[y;|h;; a;]. In particular, a; gives the conditional expected 
material payoffs: 


E(alh;ai]= S) o(zih)ri(2), 


zeZ(h) 


E[si|(h,ai)sail= > oy(zih, ai)mi(2) 


zeZ(h,aj) 


for all h e H, aj € Aj(h). E[aj|h; a;] is what i expects to get conditional on h given aj, which also specifies i’s plan. 

E[z;|(h, aj); aj] is i's expected payoff of action a; given h. If aj is what i planned to choose at h, aj,;(aj\|h) = 1, and then 

E[z;\h; aj] = E[z;| (h, aj); aj]. For initial beliefs, we omit h = @, writing E[z;; a;] for i’s initially expected material payoff. 
Table 1 summarizes our framework. 
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Table 1 
Elements of the two-stage game form. 
Notation Terminology 
ie! players 
heH non-terminal, or partial histories 
I(h) CI set of active players at h 
te {1,2} stages, or periods 


Aj(h), A(h), A_j(h) set of actions and action profiles at h 
a action of i in stage t 


at (a‘;) action profile (of others) in stage t 
zeEZ terminal histories 

Z(h) terminal successors of h 

m:Z>R monetary payoff function of i € I 

Qj, Q_j,a First-order beliefs and belief profiles 
Bi, B-i, B Second-order beliefs and belief profiles 





2.2. The frustration-aggression hypothesis and three ways to blame 


Anger is triggered by frustration. While we focus on anger as a social phenomenon—frustrated players blame, become 
angry with, and care for the payoffs of others—our account of frustration refers to own payoffs only.'!? We define player i's 
frustration at history h, given his first-order belief system a; as 


+ 
F,(h; ai) = [Bt  - max Bimi(h, a); a ; 
aje Aj(h) 


where [x]* = max{x, 0}. In words, frustration is given by the gap, if positive, between i’s initially expected payoff and 
the currently best expected payoff he believes he can obtain. Diminished expectation—E[z;|h; a;] < E[z; a;]—is only a 
necessary condition for frustration. For i to be frustrated it must also be the case that i cannot close the gap. 

At the root, frustration must be zero; nothing happened, hopes can’t be dashed: 


Remark 1. For every player i € J and system of first-order beliefs a; € A frustration must equal 0 at the initial history 
h = @, because the chain rule and own-action independence imply 


E [z;; aj] = ¥. ai, (a; |) E[mila;; wi] < max E [xla} ; oi]. 
a, €Aj(@) 
alc Aj(@) i 


Frustration is possible at the end nodes, but can’t influence subsequent choices as the game is over. One might allow the 
anticipation of frustration to be felt at end nodes to influence earlier decisions; however, the assumptions we make below 
rule this out. In our 2-stage setting, all behaviorally relevant frustration occurs in the second stage and is given by 


+ 


Fi(a'; aj) = | Elzj;ai]— max E[zj|(a', a?); a] 
a? Aj (a!) 














Preferences at a given node depend on expected material payoffs and frustration. A frustrated player is motivated to hurt 
others, if this is not too costly (cf. Dollard et al., 1939; Averill, 1982; Berkowitz, 1989). We consider versions of this 
frustration-aggression hypothesis related to different cognitive appraisals of blame. In general, player i moving at h chooses 
action a; to maximize the expected value of a belief-dependent “decision utility” of the form 














ui (h, ai; Bi) = E [74] (h, ai); a] — 6; D> By (hs Bi) E [77j| (h, a); a], (1) 
jAi 
where a; is derived from fj, and 6; > 0 is a sensitivity parameter. Thus, Bj; (h; 6;) > 0 measures how much of i’s frustration 
is blamed on j, and the presence of E [27j| (h, aj); a] in the formula translates this into a tendency to hurt j. We assume 
that 
Bij (h; Bi) < Fi(h; aj). (2) 


Because there is no frustration in the first stage, the following is true. 


13 Tn Section 5 (in hindsight of definitions to come) we discuss this approach in depth. 
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a 


(2,2) b 


(0, 0) (3,1) 


Fig. 1. Ultimatum minigame. 


Remark 2. The decision utility of a first-mover coincides with expected material payoff: for all i€ I(@), aj € Aj (@), Bi € a 
and a; € A} derived from £;, 














uj (@, aj; Bi) = Els; |aj; aj]. 


Simple anger (SA) Our most rudimentary hypothesis is that i’s tendency to hurt others is proportional to i’s frustration. SA 
is unmodulated by the cognitive appraisal (i.e., personal interpretation) of blame, so Bjj (h; Bj) = Fi(h; aj), and we get the 
following decision-utility 


ith, aj; &;) = E [| (h, aj); aj] — 9; >, F,(h; aj) E [7 j| (h, ai); a : (3) 
j#i 

We demonstrate SA via the ultimatum minigame in Fig. 1 (Gale et al., 1995), a simplified version of the ultimatum game 
of Giith et al. (1982). It has a similar structure as the stage game of Selten’s (1978) chain-store game. As noted by Gale et 
al. (1995) (p. 76), models such as ours which imply that players will reject unfair offers “provide a possible resolution of 
the chain-store paradox that applies even in the case when there is just one potential entrant.” Selten also considers that 
frustration and aggression may be relevant. Vis-a-vis Case 1 in Section 1, the setup can also be interpreted as representing 

a monopoly seller (Ann) who can offer Bob either a high or a low split of the gains from trade. 


Example 1. Ann and Bob (a & b in Fig. 1) negotiate: Ann can make a fair offer f, which is automatically accepted, or a 
greedy offer g in which case Bob’s frustration is 


Fy(g; &b) = [(1 — a (g)) -2 + op(g)an(ylg)-1—1]". 


Therefore 


up(g,n; ap) — up *(g, Ys &p) = 3M [2 (1 — a (g)) + ap (B)aH(ylg) — 1]* — 1. 


For Bob to be frustrated he must not expect g with certainty. The less he expects g, and—interestingly—the less he plans 
to reject, the more prone he is to reject. The more resigned Bob is to getting a low payoff, the less frustrated and prone to 
aggression he is. Furthermore, it is readily seen how our model can generate non-consequentialist behavior. Holding beliefs 
and other payoffs constant, increasing Bob’s payoff from f will lead to greater frustration after g, and so increases the 
disutility Bob receives from Ann’s material payoff. This makes rejection (punishment of Ann) more attractive. 














In the example, Bob rejects because he is truly angry and prefers n to y. He is not signaling his type to deter Ann from 
choosing g in the future. This marks a difference with “reputational models.” In the example, there is no future behavior 
for Bob to influence. 

Under SA a frustrated player goes after others rather indiscriminately. The word “rather” is justified because, as regards 
targets of aggression, our modeling of SA restricts attention to co-players, implicitly saying that persons who are not repre- 
sented in a game are not targets. So the modeler has a responsibility to represent the appropriate environment. If another 
individual is a relevant target (e.g., Bob’s wife in addition to Ann), that person should be included (e.g., as a dummy player). 
The exact determination of whom to include in the games is an empirical question. We have the qualitative idea that SA 
allows for innocent targets, and this is what we model. Future research may generate more nuanced insights. 

We next consider models where targets of aggression must be less innocent. 


Anger from blaming behavior (ABB) Action tendencies may depend on a player’s cognitive appraisal of how to blame. When 
a frustrated player i blames co-players for their behavior, he examines the actions chosen in stage 1, without considering 
others’ intentions: How much i blames j is defined by a function that specifies blame Bj; (a'; aj) that depends on a; (but 
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(i-é) (€) 


(2,2) a 


(1, 2) (0, 0) 


Fig. 2. Hammering one’s thumb. 


not 6;). Different functional forms are conceivable, but we will here focus on the following one: When frustrated i considers, 
for each j, what he would have obtained at most, in expectation, had j chosen differently: 


max E [mlca! a’); ai| 3 
a,<Aj(2) 


If this could-have-been payoff is more than what i currently expects (that is, E[z;|a!;qa;]), then i blames j, up to i’s 
frustration (so (2) holds): 


























+ 
Byala) min || max [rut jap] Bens sao}. (4) 


a,€Aj(@) 
Note that j cannot be blamed if he is not active in the first stage. With this, i’s decision utility with anger from blaming 
behavior (ABB) is 
uf 28 (h, aj; @) = E [mil (h, ai); ai] — 6; > Bij (h; a) E [sj] (h, ai); ai]. 
j#i 
The essence of Example 1 can also be demonstrated with ABB instead of SA, but the next example exhibits a difference 
between the two approaches: 


























Example 2. (Inspired by Frijda, 1993) Consider Fig. 2. Andy the handyman (a) uses a hammer. His apprentice, Bob (b), is 
inactive. On a bad day (determined by chance, c) Andy hammers his thumb and can then take (T) it out on Bob, or not (N). 
Assuming a@(B) = € < 1/2, we have 


Fa(B; lg) = (1 — €)- 2+ €0tg(N|B)-1—1>0. 


With SA and with 6, sufficiently high, on a bad day Andy chooses T in a fit of displaced aggression. But, since Bob is 
passive, with ABB Andy chooses N regardless of 6g. 














Next, we consider a more nuanced notion of blame, where players are concerned with their co-players’ intentions, and 
preferences therefore depend upon second-order beliefs. 


Anger from blaming intentions (ABI) A player i prone to anger from blaming intentions (ABI) asks himself, for each j ¥ i, 
whether j intended to give i a low expected payoff. Since such intention depends on a; (which include j’s plan, a;, ;), how 
much i blames j depends on i’s second-order beliefs 6;, and the decision utility function has the form (1). 

The maximum payoff that j, initially, can expect to give to i is 


1 11). 
max ae aj, j(a1 JE [41 (a},a" ;) so] 


1 
a.cA(O 
gEAgt al jeA_(@) 


> DE aj(aE[mila'; oj] = E [mila], 


alcA(@) 






































where the inequality holds by definition and the equality is implied by the chain rule. Note also that oj(-\a!) is kept 
fixed under the maximization; we focus on what j initially believes he could achieve, taking the view that at the root he 
cannot control a% but predicts his choice in stage 2. We assume that i’s blame of j at a! equals i’s expectation, given {; 


and conditional on a', of the difference between the maximum payoff that j can expect to give to i and what j actually 
plans/expects to give to i, capped by i’s frustration: 
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where q; is derived from 6;. The expression is nonnegative as per the previously highlighted inequality. Now, using (5), i’s 


decision utility after h =a! is 











uA?! (h, aj; Bj) = E [| (h, aj) ; oe] — 9; Yo Bi(h; Ai)E [s0| (h, aj); or]. 
j#i 





We are ready to illustrate a difference between ABI and each of SA and ABB: 


Example 3. Return to Fig. 1. The maximum Ann can expect to give Bob is 2, independently of ag. Suppose Bob, upon 
observing g, is certain Ann planned to offer g with probability p < 1: B,(@g(g) = p|g) =1, and that Ann expected him to 
accept with probability q: By (@a(y|g) = q|g) = 1. Finally, suppose Bob initially expected to get the fair offer (w,(f) = 1), so 
his frustration after g is F,(a'; wp) =2—1=1. We get 


Bpa(S; Bp) = min {2 — [211 — p) + gp], 1} = min{p(2 — q), 1}. 


If p is low enough, or q high enough, unlike the case with SA or ABB, Bob does not blame all frustration on Ann. She gets 
some credit for her initial intention to choose f with probability 1 — p > 0, and the degree of that credit depends on q. 














3. Equilibrium analysis of leader-follower (LF-)games 


We now develop and systematically explore predictions for the three models introduced in Section 2 (SA, ABB, ABI). Our 
main focus will be on leader-follower games with perfect information, a class rich enough to highlight striking behavioral 
patterns yet narrow enough to permit sharp results that games outside that class defy. There are two players, and only one 
is active in each stage. In the first stage the leader (denoted by £) is active. In the second stage, the follower (denoted by f) 
is active. Thus, £ does not move in stage 2, f does not move in stage 1, and there is no third party. Formally: 


Definition 1. A game form is called a leader-follower (LF-)game if H = {@} UA(@), I= {2, f}, 1(@) = {2}, and I(a!) = {f} 
or I (a!) =9 for every a! € A(@). 


Condition H = {@}U A (@) here says that no action of the leader terminates the game. This simplifies the exposition and 
is without loss of generality, because the follower’s set of feasible actions may be a singleton after some a!, in this case 
I(a') = @. For example, we interpret the Ultimatum Minigame of Fig. 1 as a game form where the responder is forced to 
accept the fair offer. The (regular) ultimatum game, the chain store game (Selten, 1978), and the “pure threats game” of 
Klein and O'Flaherty (1993, Fig. 2) are other examples of LF-games. Hammering-one’s-thumb (Fig. 2) is not. 


Sequential equilibrium (SE) While we depart from traditional game-theoretic analysis in using belief-dependent decision 
utility, our analysis is otherwise traditional. We adapt B&D’s sequential equilibrium (SE) concept.'4 We consider a complete 
information framework where the rules of the game and players’ (psychological) preferences are common knowledge.'° We 
interpret an SE as a profile of strategies and beliefs representing a “commonly understood” way to play the game by rational 
(utility maximizing) agents.'° The SE concept gives equilibrium conditions for infinite hierarchies of conditional probability 
systems. In our particular application, utility functions only depend on first- or second-order beliefs, so we define SE for 
assessments comprising beliefs up to only the second order. Since, technically, first-order beliefs are features of second-order 
beliefs (see Section 2), we provide definitions that depend only on second-order beliefs, which give SEs for games where 
psychological utility functions depend only on first-order beliefs as a special case. 

Fix a game form (as defined in Section 2, so LF-games are a special case) and decision utility functions uj(h,-;-) : 
Aj(h) x A? — R (ieI,h €H). This gives a psychological game in the sense of B&D (Section 6). An assessment is a profile 


14 B&D extend Kreps and Wilson’s (1982) classic notion of SE to psychological games; we consider the version (B&D, Section 6) for preferences with 
own-plan dependence and “local” psychological utility functions. A difference with B&D is that they assume it common knowledge that players behave as 
planned, whereas we separate plans from behavior and let the consistency of behavior with plan be a rationality condition. 

15 This is partly a modeling choice made for reasons of analytical tractability, but we note that the approach is supported by recent intriguing experimental 
evidence reported by van Leeuwen et al. (2018). They find that (pre-play) “facial cues provide a credible signal of destructive behavior,” and that subjects 
are to a degree capable of recognizing such “angry buttons.” For an analysis of incomplete-information psychological games see Battigalli et al. (2019). 

16 This is a choice of focus more than an endorsement of SE as a solution concept. SE requires that each player i is certain and never changes his mind 
about the true beliefs and plans, hence intentions, of his co-players. We find this feature questionable. B&D (Sections 2, 5) and Battigalli et al. (2019) argue 
that, with belief-dependent preferences, alternatives to SE like rationalizability, forward induction, and self-confirming equilibrium are even more plausible 
than with standard preferences. 
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of behavior strategies and beliefs (0;, Bi)iey € Xie(Xi X A?) such that 4; = xpeyA (Aj (h)) and oj is the plan a; entailed 
by second-order belief ;: 


oj (ailh) = a%,;(aj|h) = Bj (Z(h, aj) x A? Ih) (6) 


for all ie J, he H, a; € Aj(h). Eq. (6) implies that the behavior strategies contained in an assessment are implicitly de- 
termined by players’ beliefs about paths; therefore, they could be dispensed with. Yet, we follow B&D and make behavior 
strategies explicit in assessments to facilitate comparisons with the refinements literature. 





Definition 2. Assessment (0;, Bi);<; is consistent if for all i ¢ J, h ¢ H, a= (aj) A(h) 


jel 
(a) aj(ajh) = J] j<, 7j(@j|h), 
(b) marg 1 Bi(-|h) =ba_;3 


qj is derived from f; and dy_,; is the Dirac measure assigning probability 1 to {a_j} C AL 


Condition (a) requires players’ beliefs about actions to satisfy independence across co-players (on top of own-action 
independence), and—conditional on each h—each i expects each j to behave in the continuation as specified by j’s plan 
oj =aj;,;, even though j previously deviated from a;_;. All players thus have the same first-order beliefs. Condition (b) 
requires that players’ beliefs about co-players’ first-order beliefs (hence their plans) are correct and never change, on or 
off the path. Thus all players, essentially, have the same second-order beliefs (considering that they are introspective and 
so know their own first-order beliefs). These conditions faithfully capture the “trembling-hand” interpretation of deviations 
implicit in Kreps and Wilson’s (1982) original definition of SE: if i observed deviations from the plans a_; he ascribes to 
his co-players (according to f;), instead of changing his mind about the co-players’ plans, he would conclude they made 
mistakes carrying out their plans. Such mistakes are independent across nodes and players, hence the probability of further 
deviations is negligible. 


Definition 3. Assessment (0;, 6;)j<, is a Sequential equilibrium (SE) if it is consistent and satisfies the following sequential 
rationality condition: for all h ¢ H and i € I(h), Suppoj(-\h) C arg maxg,e4, ny Ui(h, aj; Bi). 


It can be checked that this definition is equivalent to the traditional one if players have standard preferences, i.e., with 
a profile of utility functions (vi :Z> R) such that uj(h, aj; 63) = E[vj|(h, aj); aj].!” A special case is the material-payoff 
game, where vj = 7; for eachie I. 


ie] 


Remark 3. Every material-payoff game with perfect information and no relevant ties has a unique SE, which is in pure strategies 
and can be computed by backward induction. 


As is known from previous work, with psychological utilities uniqueness of equilibrium with deterministic plans may fail 
even in game forms with perfect information and with no relevant ties.!® The examples in Section 3 illustrate this for the 
case of anger-prone players. On the other hand, in our framework existence of equilibrium with (possibly) non-deterministic 
plans is guaranteed under mild conditions. 


Theorem 1. If u;(h, a;; -) is continuous for alli € I, h € H and a; € A;(h), then there is at least one SE. 


B&D prove a version of this result where first-order beliefs are modeled as belief systems over pure strategy profiles. Our 
setting adds personal histories, and here first-order beliefs are modeled as beliefs about paths. Given those modifications, 
the “trembling-hand” technique used in B&D’s Theorem 9 can be applied to establish the result. We omit the details.'9 

What we said so far about SE does not assume specific functional forms. From now on, we focus on u?4, uA88, and 
ae . Since frustration and blame are continuous in beliefs, decision utility is also continuous, and we obtain existence in 
all cases of interest: 


Theorem 2. Every game with SA, ABB, or ABI has at least one SE. 


17 According to the standard definition of SE, sequential rationality is given by global maximization over (continuation) strategies at each h € H. By the 
One-Shot-Deviation principle, in the standard case this is equivalent to “local” maximization over actions at each he H. 

18 See Geanakoplos et al. and B&D. 

19 A similar technique is used in the proof of Proposition 2 (first part) in the appendix. 
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Properties of SE Next, we state two general results that we will later apply to LF-games. First, in pure-strategy SE, players 
are never frustrated on the equilibrium path: 


Proposition 1. Let (oj, 8;)j<; be an SE assessment of a game with SA, ABB, or ABI; if a history h € H has probability 1 under profile 
(Oi)ier, then 














Fi(h’; aj) = 0 and Suppoj(-|h’) Garg max E[z;| (h’, aj) ; a] 
a,cA;(h’) 


for all h’ < handi € I, where a; is derived from f;. Therefore, an SE strategy profile of a game with SA, ABB, or ABI with randomization 
(if any) only in the last stage is also a Nash equilibrium of the agent form of the corresponding material-payoff game. 


To illustrate in an LF-game, in Fig. 1, (f,n) can be an SE strategy pair under ABB, and is a Nash equilibrium of the agent 
form with material-payoff utilities.2° With (counterfactual) anger, n becomes a credible threat. Note also that Proposition 1 
implies that anger is behaviorally irrelevant in one-stage, simultaneous-move games: in our approach, frustration must be 
triggered by another player’s previous action if it is to influence behavior. 

Second, recall that two assessments are realization-equivalent if the corresponding strategy profiles yield the same 
probability distribution over terminal histories. The next result demonstrates that the material-payoff SE, with concordant 
beliefs, is also an equilibrium with belief-dependent anger. We have: 


Proposition 2. In every perfect-information (two-stage) game form with no chance moves and no relevant ties, the unique material- 
payoff equilibrium is realization-equivalent to an SE of the psychological game with ABI, ABB, or—with only two players—SA. 


Material-payoff SEs of perfect-information games (including LF-games) with no relevant ties are unique and must be in 
pure strategies (see Remark 3). By Proposition 1, players must maximize their material payoff on the path even if they are 
prone to anger. As for off-equilibrium path decision nodes, deviations from the material-payoff SE strategies can only be 
due to the desire to hurt the first-mover, which can only increase his incentive to stick to the material-payoff SE action. 

Let us next illustrate how the SE concept works under SA, ABB, and ABI in our favorite LF-example, the ultimatum 
minigame: 


Example 4. Consider Fig. 1. By Proposition 2, every utility function discussed admits the material-payoff SE (g, y) as an 
SE, regardless of anger sensitivity. To check this, just note that, if Bob expects g, he cannot be frustrated, so—when asked 
to play—he maximizes his material payoff. Under SA and ABB, (f,n) qualifies as another SE if 6, > 1/3; following g, Bob 
would be frustrated and choose n, so Ann chooses f. Under ABI (f,n) cannot be an SE. To verify, assume it were, so 
Oa(f) = 1. Since the SE concept does not allow for players revising beliefs about beliefs, we get By (a@a(f) = 1|g) = 1 and 
Bba(g; Bp) = 0; Bob maintains his belief that Ann planned to choose f, hence she intended to maximize Bob’s payoff. 
Hence, Bob would choose y, contradicting that (f,n) is an SE. Next, note that (g,n) is not an SE under any concept: Given 
SE beliefs Bob would not be frustrated and hence would choose y. The only way to observe rejected offers with positive 
probability in an SE is with non-deterministic plans. To find such an SE, note that we need aq(g) € (0,1); if @a(g) = 0 
Bob would not be reached and if a@g(g) = 1 he would not be frustrated, and hence, he would choose y. Since Ann uses a 
non-degenerate plan she must be indifferent, so a»(y) = 2/3, implying that Bob is indifferent too. In SE, Bob’s frustration at 





- - 
g is [2 (1 — aq(g)) 4 3.0ta(g) 1] = [1 foa(s) | , Which equals his blame of Ann under SA and ABB. Hence we get the 
indifference condition 


4 7 4 + 
1-6 I = 5a0(2)| -3=0-6 1 = sa) a0 


= 
3 1 
q(¥) = 4. 46,’ 


where 6, > 1/3. The higher is 6, the more likely Bob is to get the low offer, so Bob’s initial expectations, and hence his 
frustration and blame, is kept low. Under ABI we get another indifference condition: 


20 In the agent form of a game, each h where player i is active corresponds to a copy (i,h) of i with strategy set A;(h) and the same utility function as i. 
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1 — @Bpa(Z; Bp) - 3 =0 — O,BpalS; Bp) - 0 
—>> 


414 4 * 
1— 6, min 3 %a(8), 1- 3 tals) -3=0. 
The right term in braces is Bob’s frustration, while 


4 2 
3 tals) =2- [2a — Mq(S)) + 5a0(2)| 


is the difference between the maximum payoff Ann could plan for Bob and the actual one. The first term is lower if 
Oq(g) => 3/8; if, with that, we can solve the equation, we duplicate the SA/ABB-solution; this is doable if 6, > 1/3. If 
05 > 2/3, with ABI, there is a second non-degenerate equilibrium plan with aq(g) € (0, 3) where Oq(g) = 1/ (465); to see 
this, solve the ABI indifference condition assuming 4 Qta(g) <1- 4tq(g). This SE exhibits starkly different comparative 
statics: The higher is 6), the less likely Bob is to get a low offer and the less he blames Ann following g in light of her 
intention to choose f with higher probability. 














In the example, the set of SE under ABI may differ from the set of SE under SA and ABB which, however, agree. This is 
no fluke. In LF-games, under ABB the follower’s frustration can only be blamed on the leader, so in these games SA and ABB 
are equivalent. Let us write uj, to make the dependence of u; on 6; explicit. We can thus note: 


Remark 4. In LF-games, SA and ABB coincide, ice., urs. = ufee for all 6;. 


This property is not guaranteed for non-LF-games, as the following two examples show: 


Example 5. Consider the extension of the ultimatum minigame in Fig. 3. It adds a third player, Darryl (d, because we leave 
c for chance), whose payoffs are represented by the third element in the payoff vectors. It also adds an alternative way for 
Bob to accept by choosing y’ which punishes Darryl by x > 0. If Ann chooses g, assuming a, (g) =€ < ‘, we have 


F,(g; ap) = (1 — €)- 2+ e(ap(ylg) + an(y'Ig)) 1-1 > 0. 


With SA and 6 > 0, if x > 3, after g Bob chooses y’ in another example of displaced aggression. But, since Darryl is passive 
and does not move in the first stage, with ABB Bob does not blame Darryl, and Bob is indifferent between y and y’, 
regardless of 6). Note that this implies that with SA Bob does not choose n in any SE, while with ABB the pure-strategy SE 
(f,n) remains. 














LF-games (as in Definition 1) have a single follower, and Example 5 shows that the statement in Remark 4 does not 
extend to games with more followers. LF-games also do not allow for chance moves, and our next example shows that their 
presence may impede other attempts at extending Remark 4: 


Example 6. Consider again the hammering-ones-thumb game (Fig. 2). With u48 (either version), or u48!, Andy will not 
blame Bob so his SE-choice is N. But with us Andy may choose T. Recall that Fg(B; @g) = 2(1 — €) + €@g(N|B) — 1, so the 
more likely Andy believes it to be that he will take it out on Bob, the less he expects initially and the less frustrated he is 
after B. Yet, in SE, the higher is 6, the more likely Andy is to take it out on Bob: Andy’s utility from N and T is 


a 


(2, 2,0) b 


(0, 0, 0) (3, 1,0) (3,1, —x) 


Fig. 3. Ultimatum minigame with a bystander. 
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us4(B, N; oq) =1—Og[2(1 — €) + €ag(N|B) — 1] - 2, 
us4(B, T; %q) = 0 — O4[2(1 — €) + €aq(N|B) — 1]-0=0. 


Sequential rationality of SE implies that one possibility is @g(N|B) = 1 and us4(B, N; Qq) > uS4(B,T; Qa), implying 


Oq < PEst Another possibility is a@q(N|B) = 0 and u34(B,N; aq) < uS4(B,T; aq), implying 6, > a zey* Le., if Andy is 
1 1 


57-5), we can solve for the 


sufficiently susceptible to SA, on bad days he takes his frustration out on Bob. If 6¢ € (sas: Ie) 


unique SE in which us4(B, N; Qq) = usA(B, T; Qq) and @q(N|B) = Z 1-28 € (0,1). 


260, COE: 




















If Oc € (sada aaozsy) Example 6 additionally shows how we cannot, in general, take for granted the existence of an SE 


with a deterministic path (a point relevant also for uA? or uA®! in other games).”! In LF-games with no relevant ties that 


point is moot, however. To see this, note that such games are covered by Remark 3 and Proposition 2. These results imply 
that LF-games with no relevant ties have a unique material-payoff SE in pure strategies that is realization-equivalent to an 
SE with SA, ABB, or ABI. 

Example 6 also illustrates how we cannot, in general, take for granted the existence of an SE in which no player is 
frustrated along the path of play. In LF-games with no relevant ties, also that point is moot, because in the SE with SA, ABB, 
or ABI realization-equivalent to the material-payoff SE the path is deterministic and the follower cannot be frustrated on 
path. 

In LF-games, our model captures an important aspect of the psychology of anger. Sell et al. (2009) argue that anger 
“is produced by a neurocognitive program engineered by natural selection to use bargaining tactics to resolve conflicts of 
interest in favor of the angry individual.” Our approach is consistent with this view. First note that our complete-information 
analysis assumes that the leader knows how anger-prone the follower is. This is a good approximation for interactions 
between agents who know each other well, and also a good approximation of face-to-face interactions, given that humans 
are good at reading facial cues to infer personality traits such as trait-anger (van Leeuwen et al., 2018). With this, our models 
predict that in LF-game forms anger-prone followers will obtain at least as large a material payoff as self-interested ones. In 
our evolutionary past, higher material payoffs meant longer survival and better access (for males) to sexual partners, both 
of which yield higher reproduction rates (e.g., Buss, 2016). Hence, we argue that our approach supports the claim of Sell et 
al. 


Proposition 3. Jn every LF-game with no relevant ties, the expected material payoff of the follower in any SE with SA, or ABB, or ABI is 
at least as large as the material payoff of the follower in the unique material-payoff SE. 


To see why this is the case, consider an SE of the psychological game that yields a different expected material payoff 
to the follower than the material-payoff equilibrium. Since a first mover cannot be frustrated, all the actions chosen by the 
leader with positive probability in this SE must maximize his expected material payoff. Thus, if in this SE the leader deviates 
from the material-payoff equilibrium action, it must be the case that he correctly expects this action to be “punished” by 
the follower with a deviation from the follower’s material-payoff maximizing reply. This in turn requires that the follower is 
frustrated by the material-payoff equilibrium action of the leader, hence, that his expected payoff in this SE is higher than 
in the material-payoff equilibrium. 

In game forms with more than two followers, Proposition 3 need not hold. For example, there may exist equilibria where 
one follower behaves as-if self-interested, while the other is frustrated by the material-payoff equilibrium outcome. This can 
result in the first follower getting less than in the material-payoff equilibrium as shown by the following example. 


Example 7. Consider the game in Fig. 4. The leader, Ann, chooses between Left (I) and Right (r). In the left subgame Bob 
chooses between the payoff profile (7, 1,3) and 0 for all players. In the right subgame Darryl chooses between the payoff 
profile (6, 2,2) and 0 for all players. The material-payoff SE is (I, y, y’) and in this equilibrium Darryl gets 3. However, with 
SA, ABB, or ABI, for sufficiently large values of 6, and for all 6, > 0, the strategy profile (r,n, y’) can also be an SE. To 
see this note that if Ann deviates to |, Bob will be frustrated, and if 6, is large enough Bob will choose n after I. In this 
equilibrium Darryl gets 2, a payoff smaller than in the material payoff equilibrium. 














Next, we demonstrate that our models of anger are behaviorally relevant only in the subclass of LF-games involving 
threats. As above, consider a LF-game with no relevant ties. In these, the leader has a unique best response to each pure 
strategy Sf © Xa,ea,(@) Af (ae) of the follower. We let 


Te{s = df Max Je(de,SFf(a 
¢ (Sf) erwin e(de, SF (de)) 


21 In the example it even happens with a single active player, highlighting that we deal with a psychological game, as this could never be the case in a 
standard game. 
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a NS 
b d 


(0, 0, 0) (7, 1,3) (0, 0, 0) (6, 2, 2) 


Fig. 4. A game with two followers. 


denote this best response. Let (Ge, 5;) = (re (Ss), 5) denote the unique material-payoff SE, where $, is the material equi- 
librium pure strategy of the follower. With this, we define a threat as a strategy that penalizes the leader for playing the 
material-payoff SE: 


Definition 4. In any LF-game with no relevant ties, a threat of player f is a pure strategy $, such that, relative to the 
material-payoff SE: 


1. Implementing the threat harms the leader: 7; (Gg, $f (Ge)) < 1e(Ge, Sf (@e)). 
2. The leader’s best response to the threat strategy benefits the follower (who subsequently maximizes his material pay- 
off): let dg =r¢ (SF), then yf (Ge, Sf (ae)) = If (ae, Sf (ae)) > WF (Ae, Sf (Ge)). 


Our definition of a threat is inspired by, but differs from, that of Klein and O'Flaherty (1993). Note that the threat must 
be costly to implement along the path of the material payoff SE, because — by condition 1 — it differs from the unique 
material best response. Thus, the above conditions incorporate the notion of a threat that would not be credible if the 
follower were known to be a material payoff maximizer. The paradigmatic example of a game form with threats is the 
Ultimatum Game: indeed, the strategy n of rejecting the greedy offer in the Ultimatum Minigame of Fig. 1 is a threat. If 
the leader best responds to the threat, the resulting strategy pair is a Nash equilibrium of the material payoff game. This 
is a general property of threats according to Definition 4: indeed, me¢(ae, Sf (ae)) = MaXa cA -(ay) Hf (ac, af) because Sf is 
the material-payoff backward induction strategy, and every strategy pair (4, $7) such that Gy =r (§) and my (Ge, $f (@c)) = 
MAX, cA > (ae) 7 F (ae, af) is a material-payoff Nash equilibrium. Furthermore, we note that we made Condition 1 in the 
definition explicit for conceptual clarity, although it is implied by Condition 2 and the assumption of no relevant ties. To see 
this, note that the inequality in Condition 2 implies that Gg #4@,. By no relevant ties, de (respectively, de) is the unique best 
reply to $f (respectively, $7), and equality 75 (Gv, $f (ac) =f Ge, 5 (Gv) in Condition 2 implies §¢ (ae) = 5+ (ae). Therefore, 


Te (Ge, Sf (Ge)) < We(Ge, Sf (Ae) = We Ge, Sf (Ge)) < Wee, Sf (Ge)). 


To summarize: 


Remark 5. In every LF-game with no relevant ties, for every pure strategy $+ of the follower that satisfies Condition 2 of 
Definition 4, (i) the strategy pair (ry ($f) ,$) is a Nash equilibrium of the material payoff game, and (ii) $7 is a threat. 


We can use Definition 4 to characterize the SE behavior of anger-prone followers: 


Proposition 4. In every LF-game with no relevant ties where the follower has a threat, there exists a pure-strategy SE (the “deterrence 
SE”) with SA/ABB (when the anger sensitivity parameter is sufficiently large) such that 


1. The follower’s strategy is a threat. 
2. The leader does not play his material-payoff SE action. 
3. The follower’s material payoff is strictly greater than in the material-payoff SE. 


The proof of Proposition 4 involves demonstrating that the consistent assessment where the follower plays a given threat 
(and the leader best responds to the threat) is sequentially rational when the follower is sufficiently prone to anger, and 
therefore it constitutes an SE with SA and ABB. This construction does not work with ABI for the reasons explained in 
Example 4 about the Ultimatum Game: under the “trembling-hand” interpretation of deviations inherent in the SE concept, 
a deviation from the action dg that the follower wants to induce is interpreted by him as unintentional, so that he does not 
blame the leader and the threat is not credible. 
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We also provide a comparative statics result that relates anger sensitivity to material payoffs. Proposition 4 demonstrates 
that the deterrence SE in LF-games with threats exists when the anger sensitivity parameter of the follower 6; is sufficiently 
large. The proposition below relates the minimal such sensitivity parameter, 0 fr tO material payoffs: 


Proposition 5. In every LF-game with no relevant ties where the follower has a threat, the minimal value of the anger sensitivity of 
the follower 0 f that supports the deterrence SE is 


1. Increasing in the follower’s material payoff from the material-payoff SE and decreasing in the follower’s material payoff from the 
deterrence SE 
2. Decreasing in the leader’s material payoff from the material-payoff SE 


In the deterrence SE the follower’s frustration after the leader deviates to the material payoff strategy is the difference 
between his material payoffs from the deterrence SE and the material-payoff SE. Greater frustration implies that a lower 
sensitivity to anger is sufficient to support the deterrence SE. In addition the follower’s anger (which determines his decision 
utility) is increasing in the leader’s payoff from deviating to the material payoff SE, so increasing this amount reduces the 
value of @, needed to support the deterrence SE. 

The next proposition shows that, if the game form does not include a threat, then the follower behaves as-if self- 
interested in every pure SE: 


Proposition 6. Jn every LF-game with no relevant ties and without threats, every pure SE of the psychological game with SA/ABB or 
ABI is realization equivalent to the material-payoff SE. 


To illustrate, anger is not relevant in the trust game of Berg et al. (1995) (and the mini-trust game in B&D’s Fig. 1). 
In general, behavioral patterns that require players to place positive weight on a co-player’s material payoff cannot be 
explained via anger. 


4. Comparison with other models 


We now contrast the behavioral predictions resulting from frustration and anger with other models of strategic and 
social behavior. We first consider distributional preferences, which transform material payoffs at terminal histories but 
which otherwise retain the standard assumption that choices depend solely on their consequences in terms of material 
payoffs. We then discuss models of reciprocity, and some alternative approaches to modeling anger. 


Distributional preferences Like anger, models of distributional preferences such as inequality aversion (e.g., Fehr and Schmidt, 
1999; Bolton and Ockenfels, 1989) predict costly punishment. However, a number of studies demonstrate that the decision 
to engage in costly punishment depends upon both payoffs reached at terminal histories as well as payoffs from unreached 
histories.?* Our approach can capture this non-consequentialist aspect of behavior, while distributional preferences cannot. 
For example, in the ultimatum minigame of Fig. 1, Bob’s decision to reject the greedy offer may depend upon not only the 
payoffs from accepting or rejecting the offer, but also upon the payoff that Ann could have given Bob had she chosen the 
fair offer. Holding payoffs and other beliefs constant, our models predict that Bob will be more likely to reject the greedy 
offer when either he assigns higher probability to receiving the fair one, or when the fair payoff is increased. In general, 
distributional preferences cannot capture behavioral patterns which depart from consequentialism, while our models can. 

Our models assume that players care only about material payoffs and anger, disregarding distributional considerations. 
We thereby highlight the effects of frustration, blame, and anger on behavior in strategic interaction. However, real-world 
decision makers may have a mixture of material, distributional, and psychological motivations (e.g., Falk and Fischbacher, 
2006). We leave the exploration of mixed-motive concerns for the future. 


Reciprocity Negative reciprocity 4 la Rabin (1993), Dufwenberg and Kirchsteiger (2004), and Falk and Fischbacher (2006) 
joins anger as a motivation that triggers hostility. Like anger, negative reciprocity can result in non-consequentialist behavior, 
but anger and negative reciprocity differ in key ways. We sketch a comparison with Dufwenberg and Kirchsteiger’s notion 
of sequential reciprocity equilibrium (SRE; refer to their article for full definitions). 

In the hammering-one’s-thumb game (Fig. 2), Andy may take it out on Bob if he is motivated by simple anger. Were he 
motivated by reciprocity, this could never happen: Bob’s kindness, since he is a dummy-player, equals 0, implying that Andy 
chooses as-if selfish. Reciprocity here reflects intuitions similar to the ABI concept, but that analogy only carries so far, as 
we show next. 

Reciprocity also allows for “miserable equilibria,” where a player reciprocates expected unkindness before it occurs. For 
example, in the Ultimatum Minigame of Fig. 1, (g,n) may be a SRE. Ann makes offer g despite believing that Bob will 


22 Brandts and Sola (2001), Charness and Rabin (2002), Nelson (2002), Falk et al. (2003, 2008), Sutter (2007). 
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reject; given her beliefs about Bob’s beliefs, Ann perceives Bob as seeing this coming, which makes him unkind, so she 
punishes by choosing g. Such self-fulfilling prophecies of destructive behavior have no counterpart under any of our anger 
notions. Since Ann moves at the root, Remarks 1 and 2 demonstrate that she cannot be frustrated, and hence chooses as-if 
selfish. By Proposition 1, sacrificing material payoff to harm a co-player never occurs on the path of a pure-strategy SE with 
ABB, ABI, or (in two-player games) SA. 


Alia Card and Dahl (2011) show that reports of domestic abuse go up when football home teams favored to win lose. 
They argue that this is in line with the theory of expectations-dependent reference points developed by Koszegi and Rabin 
(2006, 2007; henceforth K&R). K&R model the loss felt when a player gets less than he expected, which one may think 
of as a form of disappointment with negative valence (cf. Bell, 1985, Loomes and Sugden, 1986). However, K&R do not 
model other-regarding preferences directly: they focus on the consequences of their model for individual decisions. Our 
models study the social consequences of frustration: frustration results in lower weights on coplayer payoffs, and hence 
encourages costly punishment. Our simple anger model and the example of hammering-one’s-thumb encapsulates Card and 
Dahl’s result. A key difference between this paper and K&R is that in their work anticipation of the negative valence of 
future frustrations influences decision utility. Our decision makers are influenced by past frustrations, rather than future 
ones. Important modeling details then distinguish how we define frustration and how K&R define loss (e.g., how we cap 
frustration using the highest attainable payoff). 

In a series of intriguing papers, Rotemberg explores how consumer anger shapes firms’ pricing (2005, 2011), as well 
as interaction in ultimatum games (2008). He proposes (versions of) a theory in which players are slightly altruistic, and 
consumers/responders also care about their co-players’ degrees of altruism. Namely, they abruptly become angry and punish 
a co-player whom they come to believe has an altruism parameter lower than some threshold. “One can thus think of 
individual i as acting as a classical statistician who has a null hypothesis that people’s altruism parameter is at least as 
large as some cutoff value. If a person acts so that i is able to reject this hypothesis, individual i gains ill-will towards this 
person” (Rotemberg, 2008, p. 464). Rotemberg shows how his model impressively captures the action in his data sets. It 
is natural to wonder whether our approach, which is structured very differently from his (e.g., we make no reference to 
altruism), could achieve that too. As regards behavior in ultimatum (and some other) games, there is already some existing 
evidence that is consistent with our approach; see the discussion regarding experiments below. Regarding pricing, we leave 
for empirical economists the task of exploring the topic. 

Winter (2014) and Winter et al. (2016) model anger and other emotions in games with a version of the indirect evo- 
lutionary approach”’: Like us, Winter et al. assume that preferences over outcomes are “emotional” and endogenous, but 
we differ in the way we model emotions and make them endogenous. We assume that emotions depend on endogenous 
beliefs, while Winter et al. model the rest points of an adaptation process of belief-independent preferences. 

Brams (2011) studies anger in sequential interactions by modeling players who take turns changing the state of a 2 x 2 
payoff matrix and receive payoffs at the end of the game. However, like Winter’s, his model of anger is independent of 
beliefs, while we argue that beliefs are central to emotions. 

Akerlof (2016) models anger in a two-person Bayesian game where the first mover decides whether or not to follow 
a rule, and the second mover decides whether or not to punish the first mover. Player 1’s compliance with the rule is 
motivated by a sense of duty. Player 2 may be sensitive to anger from noncompliance if she thinks that a “reasonable 
person,” modeled as a person with similar preferences to player 2, would comply. Similarly to Rotemberg, Akerlof motivates 
costly punishment via preferences over others’ types. In contrast, we assume that anger and aggression arise from frustration 
and payoff expectations. Rotemberg’s and Akerlofs approaches begin with norms about behavior, and condition anger on 
violation of those. In contrast, we develop models that reflect the psychology of frustration and anger. 


5. Discussion 


Incorporating the effects of emotions in economic analysis is a balancing act. One wants to focus on sentiments that 
make empirical sense, but human psychology is multi-faceted and there is no unambiguous yardstick. Our formulation 
provides a starting point for exploring how anger shapes interaction, and experimental or other evidence will help to assess 
empirical relevance and suggest revised formulas. We conclude by discussing sundry topics that may help gain perspective 
on, build on, or further develop our work. 


Frustration Consider substituting E[z;; oj] — E[z;\a!; a] for Fj(a'; o;) of Section 2. This would measure i’s actual dimin- 
ished expectations at a!, unlike F;(a'; @;) which reflects diminished expectations relative to what i believes is the most he 
can get (which we think of as the adequate way to represent the unexpected unavailability of something cared about). To 
appreciate how dramatically this would impact behavior, consider a two-player common-interest game: Ann chooses Out or 
In; in the former case the game ends with payoffs (1,1), in the latter case Bob chooses between (0,0) and (2, 2). Mutatis 
mutandis, for high enough 6), with the alternative, under SA and ABB, there is an SE where Ann chooses Out and Bob would 


23 See, for example, Giith and Kliemt (1998). 
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go for (0,0). Following In, Bob would be frustrated because he (so-to-say) feels locked-in with his stage-2 planned action. 
Our formulation of F;(a'; a;) rules that out. 

Take a binary gamble where with probability p > 0 Ann wins $x > 0, and otherwise gets $0. Her frustration, using 
our definition, equals her initial expectation: p-x. This embodies strong implications for how frustrations compare across 
contexts, e.g., the frustration of a highly expected failure to win the state lottery versus that of some unlikely small loss. We 
are agnostic as regards empirical relevance, but alert the reader to the issue.”4 

The evidence says a player becomes frustrated when his goals are unexpectedly thwarted (see Section 1). Here we focus 
on material rewards, and Cases 1-3 indicate broad applied potential. Yet one may also imagine other sources of frustration: 


Case 4: In 2007 Apple launched its iPhone at $499. Soon after they introduced a new version at $399, re-priced the old 
model $299, causing outrage among early adopters. Apple paid back the difference. Did this help long run profit? 
Case 5: The 2008 TARP bank bail-out infuriated some US voters. Did this ignite Tea Party/Occupy-Wall Street movements? 


In case 4, an early adopter is frustrated because he regrets he already bought, not because new information implies his 
expected rewards drop. In case 5, even an activist who is materially unaffected personally may be frustrated because of 
unexpected perceived unfairness. These examples are not exhaustive; further sources of frustration may, e.g., involve shocks 
to self-esteem.2° Techniques analogous to those we have developed may be applicable in these cases, but going in these 
directions is left for future research. 

As regards the effects of frustration, we considered changes to a player’s utility but neglected other plausible adjustments. 
Gneezy and Imas report data from an intriguing experiment involving two-player zero-sum material payoff games. In one 
game players gain if they are strong, in another if they are smart. Before play starts, one subject may anger his opponent 
and force him to stay in the lab to do boring tasks. A thus frustrated player’s performance is enhanced when strength 
is beneficial (possibly from increased adrenaline flow), but reduced when cool logic is needed (as if an angered player 
becomes cognitively impaired). Our approach can capture the first aspect, but not the second: We can let consequences of 
actions depend also on beliefs (e.g., because emotions affect strength or speed; cf. Rauh and Seccia, 2006); this ultimately 
translates into belief-dependent utility (or cost) of actions. However, to model the second effect, we would need a theory of 
endogenous cognitive abilities. 


Valence & action-tendency Psychologists classify emotions in multiple ways. Two prominent aspects are valence, the intrin- 
sic pleasantness or aversiveness of an emotion, and action-tendency, or how behavior is shaped as the emotion occurs. Both 
notions have bearing on anger. For example, most psychologists believe anger has negative valence (see, e.g., Harmon-Jones 
and Sigelman, 2001, p. 978). Perhaps such considerations steer people to avoid frustrations, say by not investing in the 
stocks. That said, the distinguishing feature of anger psychologists stress concerns its action-tendency of aggression, not 
its valence. In our theory, we exaggerate this, abstracting away from frustration avoidance, while emphasizing frustration- 
induced aggression. This is reflected in the decision utility functions, which are shaped by current frustration, not by the 
anticipation of the negative valence of future frustrations.”° 


Blame We explored various ways a player may blame others, but other notions are conceivable. For example, with anger 
from blaming behavior i’s blame of j depends on what i believes he would truly get at counterfactual histories, rather than 
the most he could get there. We view this modeling choice as reflecting local agency; i’s current agent views other agents 
of i as uncontrollable, and he has no direct care for their frustrations. Another example relates to how we model anger 
from blaming intentions: i’s blame of j depends on §;, his second-order beliefs. Recall that the interpretation concerns 
beliefs about beliefs about material payoffs, not beliefs about beliefs about frustration, which would be third- rather than 
second-order beliefs. Battigalli and Dufwenberg (2007), in a context which concerned guilt rather than anger, worked with 
such a notion. 

Our blame concepts one way or another assess the marginal impact of others. For example, consider a game where i 
exits a building while all j € I\{i}, unexpectedly to i, simultaneously hurl buckets of water at i, who gets soaked. According 
to our approach, i cannot blame any j as long as there are at least two hurlers. One could imagine that i alternatively 
blames, say, all the hurlers on the grounds that they collectively could thwart i’s misery, or that i splits the blame among 
all hurlers. Halpern (2016, Chapter 6) explores such issues and develops a model that assigns positive blame even when 
outcomes are overdetermined. 

People may also blame others in unfair ways, e.g., nominating scapegoats. Our notions of SA and ABB may embody such 
notions to some degree. However, it has not been our intention to address this issue systematically. 


24 The example involves just one player, to facilitate frustration-calculation; interesting testable implications obviously arise more generally, e.g., in modi- 
fied versions of the hammering-one’s-thumb game. 

25 See Baumeister et al. (1996) for an interesting discussion linking (threatened) self-esteem and violence. 

26 In previous work we modeled another emotion: guilt (e.g., Battigalli and Dufwenberg, 2007; Chang et al., 2011). To gain perspective note how our 
approach to valence and action-tendency was then reversed. Guilt has valence (negative!) as well as action-tendency (e.g. “repair behavior”; see Silfver, 
2007). In modeling guilt we highlighted anticipation of its negative valence while neglecting action-tendency. 
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Several recent experiments explore interesting aspects of blame (Bartling and Fischbacher, 2012; Gurdal et al., 2014; 
Celen et al., 2017). Our focus on blame is restricted to its relation to frustration, not on other reasons that may lead people 
to blame each other.?’ 


Anger management People aware of their inclination to be angry may attempt to manage or contain their anger. Our players 
anticipate how frustrations shape behavior, and they may avoid or seek certain subgames because of that. However, there 
are interesting related phenomena we do not address: Can i somehow adjust 6; say by taking an “anger management 
class?” If so, would rational individuals want to raise, or to lower, their 6;? How might that depend on the game forms they 
play and completeness or incompleteness of information? These are potentially relevant questions related to how we have 
modeled action-tendency. Further issues would arise if we were to consider aspects involving anticipated negative valence 
of future frustrations, or bursts of anger. 


Experiments Our models tell stories of what happens when anger prone players interact. It is natural to wonder about 
empirical relevance. Experiments may shed light. 

A few studies that measure beliefs, emotions, and behavior together provide support for the notion that anger and costly 
punishment result from outcomes which do not meet expectations. Pillutla and Murningham (1996) find that reported 
anger predicted rejections better than perceived unfairness in ultimatum games. Fehr and Gachter (2002) elicit self-reports 
of the level of anger towards free riders in a public goods game, concluding that negative emotions including anger are the 
proximate cause of costly punishment. 

Other studies connect unmet expectations and costly punishment in ultimatum games. Falk et al. (2003) measure be- 
liefs and behavior in ultimatum minigames; higher proportions of rejections of disadvantageous offers when responders’ 
expected payoffs are higher, consistent with our models.”® Schotter and Sopher (2007) measure second-mover expectations, 
concluding that unfulfilled expectations drive rejections of low offers. Similarly, Sanfey (2009) finds that psychology students 
who are told that a typical offer in the ultimatum game is $4-$5 reject low offers more frequently than students who are 
told that a typical offer is $1-$2. 

A series of papers by Frans van Winden and coauthors records emotions and expectations in power-to-take games (which 
resemble ultimatum games, but allows for partial rejections).2? Second-mover expectations about first-mover “take rates” are 
key in the decision to destroy income, and anger-like emotions are triggered by the difference between expected and actual 
take rates. The difference between actual and reported “fair” take rate is not significant in determining anger, suggesting 
that deviations from expectations, rather than from fairness benchmarks, drive anger and the destruction of endowments. 

A literature in neuroscience connects expectations with social norms to study the neural underpinnings of emotional 
behavior. In Xiang et al. (2013), subjects respond to a sequence of ultimatum game offers whilst undergoing brain imaging. 
Unbeknownst to subjects, the experimenter controls the distribution of offers in order to manipulate beliefs. Rejections 
occur more often when subjects expect higher offers. The authors connect norm violations (i.e., lower than expected offers) 
with reward prediction errors from reinforcement learning, which are known to be the computations instantiated by the 
dopaminergic reward system. Xiang et al. note that “when the expectation (norm) is violated, these error signals serve as 
control signals to guide choices. They may also serve as the progenitor of subjective feelings.” 

It would be useful to develop tests specifically designed to target key features of our theory. For example, which version— 
SA, ABB, ABI—seems more empirically relevant, and how does the answer depend on context (e.g., is SA more relevant for 
tired subjects)? Some insights may again be gleaned from existing studies. For example, Gurdal et al. (2014) study games 
where an agent invests on behalf of a principal, choosing between a safe outside option and a risky alternative. If the latter 
is chosen, then it turns out that many principals punish the agent if and only if by chance a poor outcome is realized. This 
seems to indicate some relevance of our ABB solution (relative to ABI). That said, Gurdal et al.’s intriguing design is not 
tailored to specifically test our theory (and beliefs and frustrations are not measured). 

A few recent studies are directly motivated by our work. Persson (2018) presents a test of simple anger. He explores the 
hammering-one’s-thumb game, and documents that frustrations occur much as predicted, but no punishments occur. His 
results thus favor ABB or ABI over SA. More recently, we have ourselves begun to study our models in the laboratory: Aina et 
al. (2018) devise tests that manipulate the responder’s payoff from the proposer’s outside option in ultimatum minigames. 
Dufwenberg et al. (2018a, 2018b) test predictions that link anger to communicated promises and threats. 


Extensions & applications We have (mostly) limited attention to psychological LF-games with complete information. An 
important task for future work is to explore whether and how frustration & anger may matter in other games, e.g., where 
many players move simultaneously in the first stage, with multiple stages, or with incomplete information. In addition, one 
may want to explore other solution concepts than SE. 


27 For example, Celen et al. (2017) present a model where i asks how he would have behaved had he been in j’s position and had j’s beliefs. Then i 
blames j if j appears to be less generous to i than i would have been, and may blame j even if i is not surprised/frustrated. Or imagine a model where 
players blame those considered unkind, as defined in reciprocity theory, independently of frustration. 

28 See the data in Fig. 2 and Table 1 of their paper. Brandts and Sola (2001) find similar behavioral results (see Table 1). Compare also with our discussion 
of Example 3 in Section 2. 

29 Bosman and van Winden (2002), Bosman et al. (2005), Reuben and van Winden (2008). 
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Formulating, motivating, and elucidating the key definitions of our models has already been more than a mouthful. We 
have therefore also not taken this paper far in the direction of doing applied economics. Make no mistake about it though, 
the hope that our models will prove useful for such work has been a primary driving force. Our psychologically grounded 
models of frustration, anger, and blame may shed light on many of the themes (e.g. pricing, violence, politics, recessions, 
haggling, terror, and traffic) that we listed at the start of this paper. We hope to do some work in these directions ourselves. 


Appendix A 
Below we define belief spaces and provide proofs of the propositions. 
A.1. Preliminaries 


To ease exposition, some key definitions/equations from the main text are repeated below. 

For each topological space X, we let A(X) denote the space of Borel probability measures on X endowed with the 
topology of weak convergence of measures. Every Cartesian product of topological spaces is endowed with the product 
topology. A topological space X is metrizable if there is a metric that induces its topology. A Cartesian product of a countable 
(finite, or denumerable) collection of metrizable spaces is metrizable. 

The space of first-order beliefs of player i is A} © Xpjen; A (Z(hi)), consisting of all conditional probability systems 
aj = (0 |Z (1i)))njen, Such that: 


e the chain rule holds: for all hj, hi € Hj, if hj <h; then for every Y C Z(h‘) 
ai(Y|Z(hi)) 

aj(Z(hi)|Z (hj) ’ 

e i’s beliefs satisfy an own-action independence property: for all h € H, aj € Aj(h), a_j € A_;(h) (using obvious abbrevia- 
tions) 


oj(Z(h})|Z(hi)) > 0 => a (Y|Z(h})) = (7) 


aj,—;(a_j|h) = aj, _j(a_ ih, aj). (8) 





The space of second-order beliefs of i is A? C Xpjen, A (Z(hi) x A!;) — where Al, = x gi A} — consisting of all condi- 
tional probability systems 6; = (6;(-|hj))hj;eH; Such that: 


e the chain rule holds: if hj ~ hi then 


_ Bi (E|hi) 
~ Bi (hi|hj) 
for all hj, hi ¢ Hj and every event E C Z(h;) x Aa 
e i’s second-order beliefs satisfy an own-action independence property: 
Bj (Z (h, @j,@_i)) x Eal(h, aj)) = Bi (Z (h, @, @_i)) x Eal(h, ai), (10) 
for all h € H, aj,a; € Aj(h), a_; € A_i(h), and (measurable) E, C en 


Bi(hj|h;) > 0 => B; (E|hi) (9) 


Note that (8) and (10) are given by equalities between marginal measures (on A_j;(h) and A_j;(h) x AM respectively). 


Lemma 1. For each player i € I, Ae is a compact metrizable space. 


Proof. Let © be a non-empty, compact metrizable space. Lemma 1 in Battigalli and Siniscalchi (1999) (B&S) establishes that 
the set of arrays of probability measures (j(-|hi))p,cn, © XhjeH; A (Z(hi) x ©) such that 


hj < hi A w(hi|hy) > O=> we (Elhi) = Pai 


is closed. Note that, in the special case where © is a singleton, each A (Z(hj) x ®) is isomorphic to A (Z(h;)); hence, the 
set of first-order beliefs satisfying (7) is closed. Letting © = A we obtain that the set of second-order beliefs satisfying 
the chain rule (9) is closed. Since xh,<H; A (Z(hi)) is a compact subset of a Euclidean space and eq. (8) is a closed condition 
(equalities between marginal measures are preserved in the limit), Lemma 1 in B&S implies that Al is a closed subset of a 
compact metrizable space. Hence, it is a compact metrizable space. 

It is well known that if X1,...,X, are compact metrizable, so is xf, A(X) (see Aliprantis and Border, 2006, The- 
orem 15.11). Hence, by Lemma 1 in B&S, the set of second-order beliefs satisfying (9) is a closed subset of a compact 
metrizable space. Since eq. (10) is a closed condition, this implies that A? is compact metrizable. 
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Lemma 2. For each profile of behavioral strategies o = (0;);<, there is a unique profile of second-order beliefs B° = (89 icy such that 
(o,, B°) is a consistent assessment. The map o +> B° is continuous. 


Proof. Write P° (h’|h) for the probability of reaching h’ from h, e.g., 


P?(a',a7|2) =| | [ojajlo) | | | [owGla’) 
jel jel 
Define a? (zjh) = P? (z|h) for allic I, he H, z€ Z. Define 6? as B? (-|h) =a? (-|h) x 8u2, for all ie I, he H. It can be 


checked that (1) 6? € A? for each i € IJ, (2) (0, B®) is a consistent assessment, and (3) if 6 4 B°, then either (a) or (b) of 
Definition 2 is violated. It is also apparent from the construction that the map o +> f° is continuous, because o +> a” is 
obviously continuous, and the Dirac-measure map @_j+> dq_; is continuous. 














Lemma 3. The set of consistent assessments is compact. 


Proof. Lemma 1 implies that xje)(Xj x A?) is a compact metrizable space that contains the set of consistent assessments. 
Therefore, it is enough to show that the latter is closed. Let (o", B). yn bea converging sequence of consistent assessments 
with limit (0°, B°°). For each i € I, let a? be the first-order belief derived from f? (n € N U {oo}), that is, 


oe} (Y|h) = Bi (Y x Al Ih) 
for all he H, Y C Z(h). By consistency, for allne N,ie/, he H, ae A(h), we have: 





© (an) of (ajh) = BP (Z(h,a) x Al Ih) = [jer 7 @jlh), 
e (b.n) margq1_6;'(-|h) = Sgn, where each a} is determined as in (a.n). 


Then, for allie J, he H, ae A(h), 


oP? (a|h) = BP? (Z(h, a) x Al j|h) =| [of?(ajlh). 
jel 


Furthermore, marg,i 67°(-|h) = Sure for all ie I and he H, because a", — @% and the marginalization and Dirac maps 














Bi rmarg at i and a@_j;+> dy_, are continuous. 


A.2. Proof of Proposition 1 


Fix i €/ arbitrarily. First-order belief a; is derived from §; and, by consistency, gives the behavioral strategy profile 
o. Therefore, by assumption each h’ <h has probability one under a;, which implies that E[z;|h’; aj] = E[z;; a;], hence 
F,(h’; a) =0. Since blame is capped by frustration, uj(h’, aj; 6;) = E[zj|h’; aj]. Therefore, sequential rationality of the equi- 
librium assessment implies that Suppoj(-|h’) C arg MAX¢'< 4;(h’) E[zi| (n’, a‘) ; ai]. If there is randomization only in the last 
stage (or none at all), then players maximize locally their expected material payoff on the equilibrium path. Hence, the 
second claim follows by inspection of the definitions of agent form of the material-payoff game and Nash equilibrium. 














A.3. Proof of Proposition 2 


Let (6, B) = (Gi, Bi)iet be the SE of the material-payoff game, which must be in pure strategies (Remark 3). Fix decision 
utility functions uj(h, aj; -) of the ABI, or ABB kind, and a sequence of real numbers (€n)neN, With €n > 0 and 0 < &€, < 
GGT for all ne N. Consider the constrained psychological game where players can choose mixed actions in 

TEL NE 1 


x} (A) = {oj(-|h) € A(Ai(h)): |loa-1h) — GiCIh)I| < En} 


if h is on the o-path, and in 


Dy (h) = {oj(-|h) € A(Aj(h)) : Va; € Aj(h), 07 (ai|h) = En} 


if h is off the o-path. By construction, these sets are non-empty, convex, and compact. Since the decision utility functions 
are continuous, and the consistent assessment map o t+ f° is continuous (Lemma 2), the correspondence 


Or XheH Xie arg max =) of (ailh)ui(h, aj; BP) 
oj; (-|h)ex; 8) se Aghity 
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is upper-hemicontinuous, non-empty, convex, and compact valued; therefore (by Kakutani’s theorem), it has a fixed point 


CO 
o". By Lemma 3, the sequence of consistent assessments (0, 67") F has a limit point (o*, 6*), which is consistent too. 
n= 


By construction, &(-|h) = o*(-|h) for h on the G-path, therefore (4, 8) and (o*, 6*) are realization-equivalent. We let a; 
(respectively, a;*) denote the first-order beliefs of i implied by (o, B) (respectively, (o*, 6*)). 

We claim that the consistent assessment (o*, 6*) is an SE of the psychological game with decision utility functions 
u;(h, aj; -). We must show that (o*, £*) satisfies sequential rationality. If h is off the o-path, sequential rationality is satisfied 
by construction. Since o is deterministic and there are no chance moves, if h is on the o-path (ie. on the o*-path) 
it must have unconditional probability 1 according to each player’s beliefs and there cannot be any frustration; hence, 
uj(h, aj; B*) = E[zj|h, aj; a] (i € 1) where a* is determined by o*. If, furthermore, it is the second stage (h = a!, with 
6 (a'|@) = 1), then —by construction— E[z;{h, aj; a*] = E[z|h, aj; a], where a; is determined by o. Since o is an SE of the 
material-payoff game, sequential rationality is satisfied at h. Finally, we claim that (o*, 6*) satisfies sequential rationality 
also at the root h = @. Let c(h) denote the active player at h. Since 1(@) cannot be frustrated at 2, we must show that 
action a! with o(a'|@) =1 maximizes his expected material payoff given belief ig): According to ABB and ABI, player 
(a!) can only blame the first mover 1(@) and possibly hurt him, if he is frustrated. Therefore, in assessment (o*, B*) at 
node a!, either 1(a!) plans to choose his (unique) payoff maximizing action, or he blames 1(@) strongly enough to give up 


some material payoff in order to bring down the payoff of 1(@). Hence, E[ze)\a!; aly! < Elsa) |a'; aly] (anger). By 


and @,q1) = Q@) (cons.). Since (o*, B*) is realization-equivalent (r.e.) to 














* aon, SH 
ua) = %o) 


(6, B), which is the material-payoff equilibrium (m.eq.), for each a! € A(2), 


consistency of (o*, 6*) and (a, B), a 


- a (r.e.) 4s (m.eq.) 
E[tyayla'; Woy] = Eltygyla';&eyl = 








= = (cons.) = (anger) 
Elzuola's oy] =" Eltyeyla's Qa] = 


= (cons.) 
E[m ayia’; of ay) =" Elttyayla’s oa]. 





























This completes the proof for ABB & ABI. If there are only two players, then we have a LF-game and SA is equivalent to ABB, 
so (o*, B*) is an SE in this case too. 














A.4. Proof of Proposition 3 


Recall that € and f respectively denote the leader and the follower and that, by convention, the leader has no terminating 
action. By Remark 3, we obtain the unique and pure material-payoff SE strategy pair, viz. (oe, of), by backward induction: 
for each ag € Az (2), let 


5p (ae) = ee (ac, af) 


denote the material best reply of f to ag (unique by assumption); then 
Vay € Ag (@), GF (5+ (av) lac) =1, 


O¢{arg max my (a¢,S¢ (a @)=1. 11 
( ae ee ¢ (ae, Sf (ae)) | ) (11) 
Let (Gv, 57) denote this pure-strategy equilibrium. We must prove that E[sr5; B] > my (Ge, Sf (Ge)) for every SE assessment 
(a, B). Now, fix arbitrarily an SE (o, 8), thus, by consistency, o is derived from the first-order beliefs implied by 6. Observe 
that 


(anger) 


Way € Ay ()\{Ge}, E[zelar; B] = me (av. 5p (ae) “=” me (Ge. 37 @)). (12) 


The first inequality holds because either f—upon observing ag—is angry enough to deviate from his material-payoff max- 
imizing action Sf (ae) and punish @, or he replies with sr (ag); the second inequality holds because (ae, 5p) is the unique 
material-payoff SE. Since in every SE the leader, who cannot be frustrated, maximizes his expected material payoff, it must 
be the case that either (i) o (ae|@) = 1, or (ii) E [ze|ae; B] < me (ae, Sf (a)), otherwise (12) implies that he would choose 
dag with probability 1. In case (i), f cannot be frustrated after ae; hence, of (SF (de) ae) = 1, o¢ (a¢|2) = 1, and the SE payoff 
of f is mf (ae, 5f (Ge), the same as in the material-payoff SE. In case (ii), of (57 (@e) |@e) <1, that is, f is not choosing his 
material-payoff maximizing action Sf (ag) because he is frustrated. Therefore, 




















E[zy; B] — yf (Ge, 55 (Ge) = A [7 7; B] — max. mF (Ge,as) =F F Ge; w) > 0, 
afeAsz Ge) 





where a is derived from f. Thus, in each case E[zy; B] > m5 (Ge, 57 Ge). 
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A.5. Proof of Proposition 4 


Recall that we are assuming that there are no relevant ties (NRT) and that (a, 5 f) is the unique and necessarily pure 
SE strategy pair of the material-payoff game. Thus, sf is the best reply function of the follower, and dy = re (Sz), where 
re: S¢ — Ag (@) is the strategic-form best reply function of the leader. We are going through some steps to construct an 


assessment (s*, B*) = (ai, Sh p) such that sy is a threat and (s*, B*) is an SE for every 6 above a threshold. This yields 
part 1 of Proposition 4. Parts 2 and 3 follow from Definition 4 of threat, which implies that a; Ad, and 7; (aj, sy (a?) = 


(ap, Sf (a7) > Tf (Ae, Sf (e)). 

Let TE be the subset of pure strategy Nash equilibria of the material payoff game (mNEs, for short) such that the 
follower’s strategy is a threat. Since the game has some threat $f (by hypothesis) and (re (Sf) ; Sf) is a mNE, TE is nonempty 
(Remark 5.i). With this, let 


TE“ :=arg max sy (ap, Sf (ae)) 
(ac.sf)€TE 


TE*:=arg max 7 (ac, Sf (ae)). 
(ac,sf)€T E* 


In words, TE* selects the elements of TE that, lexicographically, first maximize the follower’s payoff, and then maximize 
the leader’s payoff.*° Since TE is finite, TE** is finite and nonempty; thus, TE* 4 @. 


Claim. Set T E* has the following property: 


V (ae, S$) € TE® : Way € Ac (@) \ {ae}, (13) 
Tf (Ap, SF (ap) = Ws (ae. Sf (Ae) => Mela. SF (a¢)) < We(Ge, $f (ae). 


Proof. Proof of the claim We prove the claim by contraposition, that is, we prove that if (ay, $,) violates (13) then (ay, $f) ¢ 
TE*, Since TE* C TE**, if (ac, $) ¢ TE** then (av, $¢) ¢ TE*. Next suppose that (ay, $;) ¢ TE** and (13) does not hold, i.e., 
for some de € Ag (2) \ {ac}, 


mf (Ge, SF (Ge)) = WF (ae, Sf (ae)) and re (Ge, $f (Ge)) > We(Ae, SF (ae). (14) 
Let 5} (ac) = Sy (Ge) and Se (a,) =f (a)) for every a, # ae (thus, including a, = az). We are going to prove that (4c, =) € 
TE** as well. This implies that (ac,$;) ¢ TE*, because—by st (Ge) = 55 (Ge) and (14)—the leader's payoff is higher under 


(a, 7) than under (ap, $+). Indeed, (a. 3+) € TE** because 


1 Ge, $+ (Ge)) =f Ge, Sf (Ge)) >= Wy (Ge, Sf (Qe)) = WF (Ge, Sf (Ae)) > Wp Ge, Sf e)), 
where the first equality and inequality hold by S* (ae) = Sy (Ge) and (14), and the second equality and inequality hold 
because, since (ae, $-) € TE*™*, $f is a threat. Thus, (ac, 3+) satisfies Condition 2 of Definition 4 and—by Remark 5.ii—strategy 


x * 


SF is a threat. Since (a, 3+) gives to the follower a payoff as high as (av, $;) € TE**, it follows that (a, 3+) € TE**. This 





concludes the proof of the Claim. 











Fix any (aj,5;) € TE*. Note that, if the follower expects to get mf (aj, $f (a;)), then he is frustrated by an action ay if 
and only if wf (ae, 55 (ae) < mf (a7, 5, (a7), ie. his best payoff after ay is less than he expected. Now consider the off-path 
modification s% of $f that minimizes the leaders’ payoff in the “cheapest” way if he deviates from a; in a way that frustrates 
the follower, and selects the material best reply otherwise. Formally, for every ag € Az (@) let 


Aminn (ag) :=arg min sz (ae,a 
f afeAs (ae) ( r) 


denote the set of follower’s actions that minimize the leader’s payoff given ag. With this, let 


(rf (ae, Sp (@e)) < my (ae. 55 (aZ))) > sf (Qe) =arg = max zy (av, af) (15) 


30 Note that NRT allows for ties between payoffs of the leader following distinct replies to the same action ay. Hence, there may be room for such 
maximization. 
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(5 (ae, Sp (ae) = my (az. 85 (ae) = SF (ae) = 55 (ae) (16) 


for every ag € Ae (@). Since (a*,$,) is an mNE, the follower chooses the material best reply on path: $; (a) = Sy (a7). 
Therefore, (16) implies s* (ay) = 5, (a). 


Claim. Strategy st threat. 


Proof. Proof of the claim Note that 


we (a.s% (ae) = zy (a}. 35 (ai) = 25 (a. 35 (a) > 7p @e.3y Go) 
by construction and because $f is a threat. Hence, sy satisfies Condition 2 of Definition 4 and—by Remark 5.ii—strategy si 
is a threat as well. 














Claim. Action a; is the best reply (unique by NRT) to strategy SF _ that is, a; =re (s at 


Proof. Proof of the claim For every ag € A¢ (2), 


Ile (ai, st (a;)) = me (a7, 8+ (az)) = me (ae, $F (ae) = We (ac, St (a:)) , 


where the first equality holds by construction, the first inequality holds because (a, $ r) is an mNE, and the second inequal- 
ity holds by construction. To see the latter note that if the follower is frustrated, then the leader is minimized by Sis and 
if he is not frustrated, then sy (ae) = Sf (ae) and the inequality is implied by (13). Thus, a; is the (necessarily unique) best 
reply to Sy. 














Let ary denote the first-order belief of the follower that corresponds to (a;.s+), so that a* 7 (@712) = 1 and 


ay (s7 (de) lav) =1 for each ay € Ag (2). 
Claim. There is 0 ; > 0 such that, for every 65 > 0, strategy sy is a sequential best reply under SA, that is, 


Vae € Ag (@), SF (ag) € ate [= (ac, a) — Of F 5 (av: a) me (ae, as]. (17) 


Proof. Proof of the claim Eq. (17) holds by construction for every 6, > 0 at each ay such that mf (ae, Sf (ae)) = 
Tf (a7, 55 (a *)), because Fr (ac; a *)= =0 and si (ae) = Sf (dag) is the material best reply to ag. Next consider any ay such 


that 5 (ae, 3p (ae) <5 (ay, s¢ (a Dh Then, 


Fr (ac; af) = Wf (ai, 5% (ai)) - ee (ae, ay) 


=r, (aj, 8% (at) — 75 (ae, 35 (ay) > 0. 
For every such action ag, we want to show that there is a large enough 6, > 0 so that, for every ar € Af (ae) \ {55 (a), 
IF (ac, s+ (a.)) — OFF (ac; a) Ile (ac, st (ar) > mF (ae, af) — Of FF (ae; ay) me (ac, af) 
that is 
OFF 5 (ac; a) [= (ae, ay) — me (ac, 3} (ac)) > Wf (ae,ar) — WF (ae, si (a)). (18) 
Since s+ (ae) € ae (ae), He (ae, af) — me (av, 54 (a:)) > 0. Therefore 
OFF F (ac; 7) [7 (ae, a5) — me (ac, s} (a) ao 
If me (ae,af) — me (av, st (a:)) = 0, then also af € aoe (ae) is a minimizer like s% (ay). Since s% (ae) € 
a a aks (av, af), we have 0 > mf (ae, ar) SL (ac, sy (a.)) and (18) must hold. If z¢ (ac, af) = 


a (ac, sy (a:)) > 0, since Fr (ae; at) > 0 as well, we obtain the pairwise threshold 
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If (ac, af) — Wf (ac, s¥ (a:)) 
Fr (av; af) [7 (ae, af) — Ie (ae, sy (az)) | 





Of (ae, ar) = max } 0, 


With this, (17) holds if 














Of = Op = max {6 ; (ae, ar) : wf (ae, Sf (ae)) < mF (ai. s5 (ai)) af e Ar (ac)\ Apr (av) >0, 


Let s* = (a*, s+) and let B* = B° be the unique second-order belief system that makes assessment (s*, 8*) consistent 
(see Lemma 2). If 6¢ = OF, the last two claims imply that (s*, 6*) is an SE under SA, which assumes Fr = By. By Remark 4, 


in LF-games frustration and blame coincide—hence (s*, 6*) is an SE— under ABB as well. To sum up, we proved that, for 
every 67 > 0, (s*, B*) is an SE under ABB/SA where sy is a threat. 














A.6. Proof of Proposition 5 


By Proposition 4, in the given LF-Game with a threat and with no relevant ties, there is a pure strategy deterrence SE 
with a threat (3, p) for a sufficienly high 6. Thus, the set of parameters that support (3. p) as an SE of the game with 
anger is nonempty. The threshold we are analyzing is 


Of = inf [oy eR: (3.8) is SE given 6s}, 


which, of course, depends on the material payoffs. We derive an explicit expression for 6 -. 

Let @ denote the corresponding profile of first-order beliefs. As in the proof of Proposition 4 recall that (a, Sf) denotes 
the (necessarily pure) material-payoff SE strategy pair. Furthermore, $f is a threat. Suppose that the leader deviates to the 
material-payoff SE action ag. The follower’s maximal material payoff after ag is mf (ae, Sf (Ge)), while her payoff from the 
deterrence SE strategy $- in response to dy is mf (Ge, (Ge). The follower’s frustration after ay is therefore 


Fr (Ge; @p) = m5 (Ge, 85 (Ge) — wp Ge, Sf e)) > O. (19) 
Since the threat $+ of the follower is sequentially rational (given 6) after the deviation to Gz, it must be the case that 
Ur (Ge, SF (Ge); Wf) = Ur (Ge, Sf (Ge); Wf) 
and therefore that 
mf (Ge, $f (Ge)) — OFF 5 (Ge; Of) eGo, $f (Ge)) = mF Ge, Sf (Ge) — OFF 5 (Ge; Of) We Ge, $f (Ge) 
rearranging this expression and collecting terms we have 
OFF f (Ge; &f) (x0 @e, 5 (de)) — We (Ae, 3-@))) > Ws (Ae, $f (Ge)) — Wp Ge, $f (Ae) 
The right-hand side of this expression is positive because sf is the material-payoff SE strategy of the follower and hence 


maximizes zf(-) after ag. This implies that zy(Ge, $f (@¢)) — me(Ge, $f (@e)) is also positive, since 0¢ > O and Fr (a; Gf) > 0. 
Moving all terms besides 6f to the righthand side we have 


1 f (Ae, Sf (Ae)) — WF (Ge, $F (Ae) 
Fr (ac; &) (x0 @e, Sf (Ge)) — We (Ae, §-@))) 





Of = 


and substituting in the follower’s frustration after the leader’s deviation to the material-payoff SE action ag we have 


7 f (Ge, Sf (Ge)) — WF Ge, $f Ge)) 
(75 @e.35 (ae)) — wp Ge, Sf (@))) (sre @e, 3 -@x)) 7 mee, §@e))) 





Of = 


=O. (20) 


That is, the inf is a min, by continuity of decision utility in 6¢. With this, the proposition may be verified by inspection.of 
the expression for @ ,. 
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A.7. Proof of Proposition 6 


Let G6 = (a, Sp) be a pure SE strategy pair with SA/ABB or ABI of a LF-game with no relevant ties, and let @ denote the 
corresponding profile of first-order beliefs. First note that the consistency condition of SE implies that deviations from de 
would be rated by f as unintended mistakes, hence f would not be angry under ABI. Therefore, there the pure SE with ABI 
is unique and it coincides with 6. Next we consider SA/ABB and we prove the result by contraposition, that is, we show 
that if path (@¢, $+ (@e)) differs from the material-payoff equilibrium path (@p, 5 (d¢)), then there is a threat. 

As the leader cannot be frustrated at the root, and the follower cannot be frustrated after dg, which is expected 
with probability 1, we must have dy = ry (Sr) and $f (ay) = arg MAX, <a(ay) TF (Gc, ap) = Sy (Ge). Suppose (Gy, $f (@e)) # 
(a, Sf (a@)). Then de #a¢ and 


IT¢ (ae, Sf (ae)) = Ie (ae, SF (ae)) > Ie (a, SF (ae) 7 (21) 
because Gy = ry ($f). On the other hand, since (Gy, 5/) is the unique material-payoff equilibrium 


IT¢ (Ge, Sf (ae)) or IT¢ (ae, Sf (@)). 


Therefore, 


$-(a S- (Ge) =arg max zr (a@,arz), 
f Ce) ASF (Ge) B peas) f (Ge, af) 


that is, the follower is not choosing his material-payoff best response. This can happen in an SE with SA/ABB only if the 


follower is frustrated after ay, that is, 


Ep (aes) Tey (Ais (Ae) mae. rey (ass) mg (Geis Ke) —mry tas Sy a) >. 


Taking into account that $f (Gc) = 5, (Gv), this shows that Condition 2 of Definition 4 of threat holds. Furthermore, the 
material-payoff SE condition for the leader and (21) yield 
oot, hee CRC) a. oe RR me. pee OD ger ke ve 
me(Ge,S¢ @e)) > me (Ge, Sz (ae) @ we (Ge, Sf (Ge) > se (e, $+ (Ge)). 


Therefore Condition 1 holds as well. 
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